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Abstract 



The equilibrium distribution of a reversible coagulation-fragmentation process (CFP) and the 
joint distribution of components of a random combinatorial structure(RCS) are given by the same 
probability measure on the set of partitions. We establish a central limit theorem for the number 
of groups (=components) in the case a{k) = qkP~^, k > 1, q,p > 0, where a{k), fc > 1 is the 
parameter function that induces the invariant measure. The result obtained is compared with the 
ones for logarithmic RCS's and for RCS's, corresponding to the case p <0. 

1 Summary. 

Our main result is a central limit theorem (Theorem 4.6) for the number of groups at steady state 
for a class of reversible CFP's and for the corresponding class of RCS's. 

In Section 2, we provide a definition of a reversible k-CFP admitting interactions of up to k groups, as 
a generalization of the standard 2-CFP. The steady state of the processes considered is fully defined 
by a parameter function a > on the set of integers. It was observed by F. Kelly {^^, p. 183) that 
for all 2 < A; < the k-CFP 's have the same invariant measure on the set of partitions of a given 
integer A^(= the number of particles). 

Section 3 explains the idea of A. Khintchine's probabilistic method for derivation of asymptotic 

formulae. In the spirit of the method, we construct a representation of the probability function of 

the number of groups via the probability function of the sum of i.i.d. random variables. 

In section 4 we study the case when the parameter function a is of the form: a{k) = qk^^^, k > 1, 

q,p > 0. We prove a local and a central normal limit theorems for the number of groups at equilibrium, 

as ^ oo. To achieve this, we employ a new (for this field) tool: the Poisson summation formula. 

We conclude the section by providing some intuition for the main result of the paper. 

In Section 5 we recall that the invariant measure of a reversible CFP can be viewed as a joint 

distribution of components of a RCS, known as an assembly. Comparing our main result with the 

known ones for RCS's, we identify p = as a point of phase transition of the invariant measure, as 

N ^ oo. We also provide a few examples of RCS's that conform to the case p > 0. 

2 CFP's with multiple interactions: Definition. 

Following JJ, 0, we treat a CFP as a continuous-time Markov chain on the finite set = {v} of 
all partitions t] = (ni, . . . , un) of a given integer N: 



Here codes the total population of indistinguishable particles partitioned into groups(=clusters) 
of different sizes. A group of size j > 2 may split into a number, say s, 2 < s < j, of groups of sizes 



N 





(2.1) 
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ji, ■ ■ ■ , js '■ ji + ■ ■ ■ + js = j and, conversely, the above s groups may coagulate into one large group 
of size j. We will call these s-interactions(=s-transitions), Js -fragmentation and Js- coagulation 
respectively, where Js = (ji, • • • ,js)- A stochastic process that admits interactions of up to < 
groups will be denoted k-CFP. Note that both types of the interactions conserve the total number 
of particles. 

We now provide a formal definition of a k-CFP that naturally extends the definition of the standard 
2-CFP (see j7j, jllp- A k-CFP is given by the rates of infinitesimal transitions that are assumed to 
depend only on the sizes of interacting groups. For given N and s, 2 < s < k, let ips,4's > be a 
pair of functions defined on the same set 

Ss = {Js = (j'l, . . . , js) ■■ ji>0, / = 1, . . . , S, jl + ...js < N} 

of s-tuples of positive integers, depicting the sizes of interacting groups. The functions ips,4's are 
the rates of J^-coagulation and J^-fragmentation respectively. Both functions are assumed to be 
invariant w.r.t. all s! permutations of ji, . . . ,js. To complete the definition of a k-CFP, it is left to 
determine the total rates of all possible s-transitions from one partition r] G flN to another. Assume 
that the given Js = (ji, . . . , Js) G Js and r/ = (ni, . . . , titv) G are such that ^ m; > 1, I £ Js, 
where mi counts the number of components in Js that are equal to I. In other words, we assume, 
that a Js-coagulation of some groups in the partition r] is possible. Clearly, a given Js-coagulation 
of any groups in rj transforms r] into the same partition that will be denoted ry^'^") G JIat. Similarly, 
the result of Jg-fragmentation of any groups in r/ G Q]\f will be denoted r/(j^) G CIn. By a simple 
combinatorial calculation, the total rate ^s(«^s;??) of all possible Js- coagulations at the partition ij 
is 

*«(^.;^) = (n(;^fz^)^^(^^)' •^'^^J- (2.2) 

where the product is taken over all distinct components ji of Js. By the same logic we define the 
total rate ^{Js;rj) of all possible Js-fragmentations at the partition r] to be equal to 

$s(Js;??) = n\j^\^s{Js), Js G Js, (2.3) 

where | Js| = Ji + • • • + Js- Now we see that a k-CFP is fully defined by the {k — 1) pairs of functions 
i/jsAs, s = 2,...,k. 

In this paper we will be concerned only with reversible CFP's. Define the ratio of s-interactions 



QsiJs) = ' 



ifgj, if Js : ^s(Js)'/'s(Js) > 0, JsGJs, 

(2.4) 

0, otherwise. 



A natural extension of Theorem 1 in gives the following characterization of reversible k-CFP 's 
with positive transition rates. 



3 



Proposition 2.1 Let 

iPsWMJs) > 0, JseSs, s = 2,...,k. (2.5) 
Then the corresponding k-CFP is reversible iff the ratios Qs, s = 2, . . . ,k are of the form: 

g,(J,) = "^'-^^P J, = (ji,...,i,)Gj„ s = 2,...,k, (2.6) 
Ul=i aU) 

where a > is a function on the set of positive integers. 
The proof is deferred until after Proposition 2.2. 

A historical remark: It was already noted in j7j, that the characterization of reversible 2-CFP's 
was motivated by the following two completely independent lines of research: the seminal paper of 
F. Spitzer |17j on nearest-particle systems and F. Kelly's and P. Whittle's works in the 1970-s on 
networks and clustering process in polymerization (see JH). ■ 

We will call a the parameter function of a reversible CFP and we will write aj = a{j), j > 1. The 
following result gives the explicit form of the steady state of the processes. 

Proposition 2.2 /^.TTl^.Ch.S) 

For a given N, all k-CFP's, k = 2,. . . ,N satisfying h2.b]) for some parameter function a have the 
same invariant measure onVtN : 



1^n{v) = {cnT^ \ \"' 77 = (m, . . . , n;v) G ^N, (2.7) 

ni!n2! . . . un^ 

where Uj > 0, j = 1,2, . . . , N and cn = CAr(ai, . . . , a^) is the partition function of the measure fi^ '■ 

n^i n2 riM 

co=i, c^=y: "^;v" ^ ' ^^1- (2.8) 

^ ni\n2\...nN\ 

Proof: We have to show that the measure fiN given by 1)2.7^ satisfies the detailed balance 
condition: 

mivWiv, = m{OV{^, v), v, C e ^n, (2.9) 

where V{ri, ^) is the total rate of the infinitesimal (in time) transition from r] to ^. If V{r], = 0, 
then (|2.9|) is trivially true. If now S_ = rj^-^") for some Jg G Js, 2 ^ s ^ A;, then we see from (|2.7j) and 
(E21) that 



In a similar manner one can verify (|2.9)) in the case = r](j^) for some Js £ Is- 
Now we are in position to give the 



(2.10) 
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Proof of Proposition 2.1: If a k-CFP, > 2 is reversible, then it follows from [7j, Theorem 1, that 
H2.6|) should hold for s=2. The latter fact implies that the unique invariant measure of all k-CFP-s, 
k = 2, . . . , N is given by (|2.7() . Consequently, H2.6() should hold for 3 < s < A;, by Proposition 2.2. 
The same reasoning proves the converse part of the claim. ■ 

The preceding discussion shows that the steady state of a reversible k-CFP is uniquely determined 
by a parameter function a. 

3 Khintchine's type representation for the probability function of 
the number of groups. 

Our objective will be the study of the asymptotic behavior, as ^ cxd, of the number of groups 
at equilibrium given by the measure /xjy. It follows from (|2.7() and 1)2. 8() that 

As in IHl, ini, our tool will be the probabilistic method by A. Khintchine introduced in the 1950's 
in his seminal book (12) . The idea of the method is to construct the representation of the quantity 
of interest via the probability function of a sum of independent integer valued random variables 
depending on a free parameter, and subsequent implementation of a local limit theorem. This allows 
for the derivation of the desired asymptotic formula. 

In number theory, the implementation of Khintchine's method was developed by G. Freiman et al 
(for references see |15|.|n]l. In particular, a general scheme of the method for asymptotic problems 
related to partitions was outlined by G. Freiman and J. Pitman in ^Uj. The method was applied to 
CFP's for the first time in for derivation of the asymptotic formula for the partition function of 
the measure /Utv in the case ~ kP~^, k — > oo, p > 0. In 9 the method was used for the study of 
the asymptotic behavior of some quantities related to clustering of groups at the steady state, when 
Ofc ~ kP~^L(k), k — > oo, p > 0, where L is a slowly varying function. 

Though the implementation of Khintchine's method goes along the standard scheme, the related 
asymptotic analysis varies from problem to problem. In contrast to the aforementioned research, 
the problem treated in the present paper requires knowledge of the second term in the asymptotic 
expansions considered. In light of this, we employ the Poisson summation formula, a new tool for 
this field. 

We will always assume that the parameter functions a considered are positive and s.t. the power 
series 

oo 

YjUkx'', xeC (3.12) 

k=l 

has a finite radius of convergence R > 0. Since the transformation =^ h^a^, h > 0, k = 1,2, . . . , N 
does not change the measure fiN given by 1)2. 7|) . we assume w.l.g. that R = 1. 
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Now let Cii • • • ) be i.i.d. integer valued nonzero r.v.'s defined by 

nCi = l) = -^, ai>0, 1>1, (3.13) 
where 5 > is a free parameter and 

oo 

S{6)=Y,ake-''. (3.14) 

k=l 

Note that the r.v. has finite moments of all orders for all 5 > 0, since R = 1. We start with the 
following representation of the probability P(z^Ar = n). 

Lemma 3.1 Define 

n 

Tn = ^Ck, n>l, 
k=l 

where .^fc, k = 1,2, . . . are i.i.d. r.v. 's given by i'j.l'A) , fcl.l4\ )- Then 

P(i/^ = n) = (c^n!)~^5"(<5)e^^P(r„ = N), 6> 0. (3.15) 
Proof: It follows from (l3T3|l . dTTHl that 



-jv „ „N In! 



E Q-l "2 ■ • • "at 
ni!n2! . . .riAf! (E"=i"fc=") 
m. = A-) = . (3.16) 

By (113) and (|XTT|) this imphes the claim (|TT3|) . ■ 

Remark. (|3.15|) has a form of a typical representation in Khintchine's method. It can be also viewed 
as a version of the representation formula for the total number of components in the generalized 
scheme of allocation (see Lemma 1.3.3). 

Our study will be heavily based on the assumption 

oo 

^^A;afc = oo. (3-17) 

k=l 

Denote 

oo 

Mi = Mi{n;6) =BTn = nE^i = nS'^{6)'^kake-^'', 6>0 (3.18) 

k=l 

and choose the free parameter 5 as a solution of the equation 

Mi = N, n<N. (3.19) 

Such a choice of the free parameter is typical for Khintchine's method (m], p. 110) and is designed 
to make the probability P(T„ = N) in ()3.15|) large, as n,N ^ oo, n < N. 
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Lemma 3.2 Under condition the equation i3.iy\) has a unique solution 6 = 6n,N for all 

n< N. 

Proof: We first show that Mi is decreasing in 5 > : 

-S{S) Er=i k'a.e-^'^ + ( ^^=1 ka^e-''') ' 
M (n; 6)=n -^—^ ^ < 0, 6> 0, (3.20) 

by the Cauchy-Schwarz inequality. Consequently, Mi(n;0) = sup Mi (n; 5) := nMi where does 

not depend on n. If now S{0) < oo, then (|3.17|) immediately implies = oo. In the case S{Q) = oo, 
supposing < oo leads to the contradiction: 



> Mi{n-5) -uMl = n- 



S{6) 



> nS'\6) i A{6) + - kake-^''], 6 > 0, (3.21) 

y k>2M* / 



where A(5) is bounded for any 6 > 0, while the sum in the brackets tends to +oo as (5 — > 0"^, by 
(|TT7|) . Hence, Mi(n;0) = +oo. Finally, 

Ml (n; oo) = n lim ^'=' = n. (3.22) 



4 A central limit theorem for the number of groups. 

Our paper is devoted exclusively to the case when the parameter function a has a polynomial rate 
of growth, namely: 

ai, = qkP-^, q,p>0, k > 1. (4.23) 

We first consider the case q = 1. The following lemma which is basic for our subsequent asymptotic 
analysis is a particular case of the Poisson summation formula (see jH] , [2| ) • 

Lemma 4.1 (J^, p. 82). Let p > 1 and Re{z) > 0. Then we have 

^ e-"^F-i = r(p) ( 5^ (^ + 27rz/)-f J . (4.24) 

fe=l ^/=-oo ^ 

With the help of this remarkable identity we derive the following asymptotic formula that holds for 
p > 0. 
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Lemma 4.2 If p > 0, \z\ 0, Re{z) 0+, then 

oo 

e^'^kP-^ = T{p) (z-P + A{p) \ + 0{z), (4.25) 



fe=i 

where A{p) is a constant which in the case p > 1 is given explicitly by 

A(p) = 2(27r)-^C(p)cos^. (4.26) 
(Here ({p) is the Riemann zeta function). 
Proof: First consider the case p > 1. Let 

— 1 oo 

G{z) = Y {z + 2TTiiyP + 'Y{z + 2TTiiy'P, Re{z)>0, p>l. (4.27) 

/=— oo 1=1 

The two series in the RHS of (|4.27|) converge absolutely when Re{z) > 0, while, by straightforward 
calculations, G(0) = A{p), G'(0) = -pA{p + 1) with A{p) given by Consequently, by 

we have for p > \ 

e-^'^F-i - Tip)z~P = r(p)G(z) = r(p) (ciO) + G'(0)z + o(z)') , 
fc=i ^ ^ 

\z\ 0, i?e(z) ^ 0+. (4.28) 

This proves (|4.25|1 for p > 1. So, we write for p > 

oo 

Ye''''kP = r{p + l)(^z~P-^ + A{p+l)^+0{z), \z\^0, Reiz)^0+. (4.29) 

k=l 

Next, integrating 1)4. 29() w.r.t. z gives ()4.25() with a constant A(p) that is not known explicitly. ■ 
Now we are in a position to derive the asymptotic formula for the free parameter 5. 

Proposition 4.3 Assume that n is s.t. a := ^ — > oo, as N ^ oo. Then 

6 = pa-^(l- A{p)fa-A +o{a"P-^), a ^ oo. (4.30) 



Proof: First observe that in the case of the function a considered, it follows from ()3.18|) . H3.19() and 
the proof of Lemma 3.2, that a = — ^ oo implies 5n,N O"*". Implementing 1)4. 25(1 gives 



6-P-' + Aip + l)+0{5) _ 

k=l 

p6-^(l- A{p)SP + o((5P)) , 5^0+, p> 0. 



(4.31) 
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Now (|XT^ leads to 



5 = pa-^ [l - A{p)5P + o{6P)j , 6 = 6n,N ^ 0+ , a ^ oo. (4.32) 
Iterating this equation w.r.t. 6 gives ()4.3U() . ■ 

We will focus now on asymptotics, as a ^ oo, of the probability P(T„ = N) under 5 given by H4.3U() . 
We have 

P(r„ = N) = (27r)-i <^(t)e-**^dt) , (4.33) 
where ip is the characteristic function of T„. 

The basic idea of the Khintchine's method is that for a wide class of models, choosing the free 
parameter from the condition (|3.19|) guarantees that the main contribution to the integral in the 
RHS of (|4.33|) comes from a set which is some neighborhood of zero. 

We will demonstrate that this is in effect true in the case considered and will prove the normal 
local limit theorem for the sum T„, as a — > oo. As a preliminary step, we verify the validity of the 
Lyapunov's condition 

lim ^ = 0, (4.34) 

where M2 = M2{n;N), M3 = M^{n;N) are correspondingly the variance and the third central 
moment of T„ under 5 given by (|4.3()|) . 
We have 

00 

M2 = VarTn = nVar^i = n(^S-\5) ^ k^akc'^^ - a^) , (5 > (4.35) 

k=l 

and 

M3 = E{Tn - Nf = nE{ii - af = n{Eil - SaE^f + 2a^). (4.36) 

Applying now ()4.25|) and ()4.3U|) gives the following asymptotic expressions for the moments consid- 
ered: 

M2 ~ p(p + 1)(5~^ =np6~^, 

M3 n(^p{p+l){p + 2)5-^ -3p'^{p + l)5-^ + 2p^6-^^ =2p5-^n, n,a^oo. (4.37) 
This proves ()4.34|1 . The condition 1)4. 34(1 provides the existence of /3 = P{n,d) > s.t. 

/J^Ma ^ 00, I3^M3 ^0, n,a^ 00. (4.38) 

Explicitly, in view of (|4.37j) . 

P = Sn-^, < e < 1/3 (4.39) 
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satisfies ()4.38() . As it will be shown below, [—/?,/?] is just the required neighborhood of zero. 



Lemma 4.4 (The local limit theorem for Tn-) 
Set 

n = QpNl^+sN^, Qp =P""^(r(p + 1))^, s(^R, p>0. (4.40) 

Then 

Proof: We write 

I = h+h. (4.42) 

/TT 
ip{t)e~'^^dt and/1,/2 are integrals of the integrand of I taken over the sets [— /3, /3] 
-TT 

and [— TT, — /3] |J[/5, vr] respectively. 

Step 1. We find the asymptotics of the integral Ii, when /3 is as given by ()4.39p . 
By the definition of a, 

^{t)e-''^ = ^Ut), teR, (4.43) 

where (pi is the characteristic function of the r.v. ^1 — a. Next, denoting by ip2 the characteristic 
function of the r.v. (p"^/^^) (^1 - q), we get from (|0^ - (|07|l 

^2(t) = l-^t' + 0(t'), t^O. (4.44) 
Combining this with the relationship 

ip2{t) = ipi{P~^^^St), t^R, (4.45) 

1)4. 43|) becomes 

1 



^(t)e-^*^ = ( 1 - -{Vp6-Hf + 0{6-H^) 



exp{ -^{^6-^tf + nO{5-^t^)), td'^ ^ 0, n^oo. (4.46) 



Consequently, by virtue of ()4.37() and 1)4. 38() . 



/ 27r 

Step 2. We are to show that 

h = o(/i), N^oo, p>0. (4.48) 
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We apply with z = 6 - it, t e [P, vr] to obtain from (|^^ the following analog of 

e-^^F-i = T{p) (z~P + G{-it) + (5G'(-it)') + o{6), p>l, t G [/?, tt], 5 ^ 0+, (4.49) 
k=i ^ ^ 

where \G{it)\, \G'{it)\ < const := B{p), t G [/3, vr], p > 1. By the same argument as for (|4.29p the 
latter yields 

In the rest of this section it is always assumed that n is specified as in 1)4. 4U() . In view of H4.4U() we 
have 



na-P = ^=p-Pr{p) + eN, P > 0, N ^ oo, (4.51) 



where 



eN = ip+l)QPsN~^ + -^^-^^ ^ QP-^s^N~^ + 0{N'^), p>0, N ^ oo. (4.52) 
This fact will be repeatedly used in our subsequent asymptotic analysis. Consequently, 

nSP = r{p) +pPeN + 0{a-P), p >0, N ^ oo. (4.53) 
Employing 1)4.53(1 we further obtain from (|4.50() and (|4.39|) 

L(t)r <o(^exp(-|n^)^, t€[(3,7T], p>0, n^oo, (4.54) 



for any < e < |. This together with (|07|) proves ■ 
To establish our main result, it is left to find the asymptotic formulae for the rest of the factors in 
(|3.15() . First we make use of the following result of |S]: 



1 / ^ \ 

CN ; exp [Na + y" F'^e'^'" , iV ^ oo, p>0, (4.55) 

where 

<J = aNr^ ( I>Tl) J ' ^ ^ ^ ^ °° ^^•^'^^ 
is the unique solution of the equation 

N 

e-'^'^kP = N, p>0, (4.57) 

k=i 

while 

TV 

= ^ e-'"'kP+\ p>0. (4.58) 



k=l 
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For our purpose, we need to know the second term in the asymptotic expansion ()4.56|) of a. In what 
follows we denote by eTv different quantities tending to zero, as ^ oo. By our asymptotic formula 



^k^e-^'' = T{p + l)(a-^^^^^ +A{p + l)\+0{a), p > 0, N ^ oo, 
k=i ^ ^ 



(4.59) 



where A{p) is as in Lemma 4.2, while, by the Euler summation formula, we have for any 7 > 0, and 
a = £7 AT given by (|4.56|) . 



V Fe-'='^~/ xPe-^'^dx = a-^P+^U x^e-^dx = o(iV-T), p > 0, N ^ 00. (4.60) 

k=N+l -^^+1 JcriN+l) 

In view of ()4.59|) . H4.60() . the equation 1)4. 57(1 can be rewritten now as 

T{p + 1) ( + A{p + 1) ) + 0{a) = N. (4.61) 



Consequently, we get 

/ N \ "FTT 

-=(r^-^(^ + i) + ^-j = 

r(p+l)')'^iV-^ + ^l(^f^:^')^ + o(iV-fi?), p>0, iV^oo. (4.62) 
J p+1 \ N J 

This yields 

Na = N^^{T{p + l))^ +eN, p > 0, N ^ 00. (4.63) 
Next, we obtain from (^351) . (P38|l . (lOfij) and (I02|) 

^N-[j^^^y^'nP + '^), P>0, iV^oo (4.64) 

and 

N 00 



k=l k=l 

^(P\ r{p+1) ) + r(p)^(p) + eiv, P>0, N^oo. (4.65) 
Substituting the above expressions in (|4.55() gives the desired asymptotic formula for cat: 

P+2 p 

CN ^ hiN 2{p+i) exp(/i2iVf+i + /is), p>0, N ^ 00, (4.66) 
where the constants hi, i = 1,2,3 are given by 



hi 



r(p + i)j 
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h2 = {p+ l)Qp, 

h3 = T{p)A{p), p>0. (4.67) 

Next, (|I3T|1 . (|i:in|l and give 

exp{5N) r^exp(^pn-A{p)T{p+l)^, p>0, N ^ oo, (4.68) 

while ()4.53p gives 

S^{S) = (t{p)6-p + A{p)T{p) + 0(5)) " ~ (r(p)) "d^^'" exp (^A{p)T{p)^ , 

p>0, N ^oo. (4.69) 

We again apply (|4.51l) to get 



§-P^ ^ exp \^A{p)T{p + 1) ) , 

n'^p^'^expl -n(il|y6^-l(T5y)'^^ + 0(6?,))) 



r(p) 



, p>0, N ^oo, (4.70) 



where eAr is given by (|4.52|) . Observing that ne^ ^0, ^ cxo, we write out now the asymptotic 

2 

N 



expressions for ne^ and nej^ to obtain 



5"(5) ~ exp - ^ - s{p + l)A^5^fe + ^(p) (r(p) + r(p + 1))) , 



dp = ^, P>0, iV^oo. (4.71) 



Finally, substituting in (|3.15|) the preceding asymptotic expansions and employing Stirling's asymp- 
totic formula gives the following result. 

Theorem 4.5 (The local limit theorem for un)- 
Let n he given as in \4--4^ - Then 

1 - V f \ 

P(z.^ = n) ; exp - — := /(iV; s), s G i?, p > 0, iV ^ oo. (4.72) 

This leads to our main result that says that almost all the mass of the probability distribution of 

V V 

the r.v. i^iv is concentrated , as ^ cxd, in a neighborhood of size 0{N^p+'^) of the point QpNp+^ . 
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Theorem 4.6 (The central limit theorem for )■ 

Uf, - QpN^ 



N{0,1), p>0, N^oo, (4.73) 



where =^ denotes weak convergence and dp,Qp are as in J^. 71[ ), fl-40{ ) respectively. 

Proof: We provide a sketch of the proof that is done by the implementation of the standard technique 
of passing from the local theorem to the integral theorem (for more details see f|16j. p. 59, [S], p. 81). 
It follows from (|I?7^ that for any a < 6, a,b G R, 

p 



sGRn 

where f{N; s) is given by ()4.72|) and 



^^^G[a,b]) = fiN;s), p>0, iV ^ oo, (4.74) 



V 

]\[2p+2 



= {se [a, b]:n = QpNp+^ + sN^p+^ G N}. 
Since \Rn\ = 0(7VvT2)^ as ^ oo, we have 

exp^ - ^^N^"^ ^ j exp^-^^dx, p > 0, ^ oo. ■ (4.75) 



Remark : Since the transformation ^ h^Of^, h > 0, k > 1 of the parameter function a does not 
change the measure fiN, the results of our paper are true for Ok = h^k^'^, k > 1, h > 0, p > 0. ■ 
One more extension of Theorem 4.6 is provided by the following result. 



Theorem 4.7 ; If Ok = qk^ ^, p> 0, k >1, where q > is a constant, then 

7V(0,1), p>0, N^oo, (4.76) 



Ur, - QpNp+i 



— 1 ~ 1 

where Qp = qp+^ Qp , dp = qp+^ dp 



Proof: Denote by • the quantities related to the parameter function a. We see from (|3.13p . ()4.57j) 
and ()4.62|) that the r.v. has the same distribution as and that 



1 



(Tat ~ gp+i fjAT, N ^ CO. (4.77) 

Repeating the preceding asymptotic analysis gives the claimed change in the scaling induced by 
g > 0. ■ 

We conclude this section by providing some intuition for the scaling in the central limit Theorem 4.6. 
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For this purpose we employ the following result established in p| for the model in question. Denote 
by q^f]), q{i]) the size of the largest (resp. smallest) group in a random partition rj £ ^l^. Then 



lim F (n^ ' < q{r]) < N^^') =1, p > 0, (4.78) 



for all e > 0, while 



lim Pfgfry) > I ) 



0, if Z = 7V^, < /5 < 1, 

z-i 

Oj j , if / > 2 is a fixed number. 



exp 



(4.79) 



From (|4.78)) and (|4.79j) one may conjecture, that for large A^, the main "mass", of size 0{N), is 
partitioned into groups(=clusters) of sizes O (^Np+^^, while the rest of the mass, of size o{N), is 
partitioned into groups of small sizes. Adopting this conjecture gives the expectation of the number 
of groups as O ( — 1=0 ( Np+^] . Concerning the asymptotic variance of vn, we note that 
the relationship Var^vj^f) = 0(Ei>]\f), — > oo holds also for permutations, the Ewens sampling 
formula and for some other RCS's (see P, |E])- 



5 RCS's: examples and comparison with known results. 

It was already observed in and [S] that particular cases of the expression 1)2. 7|) for the equilibrium 
measure fiN conform to joint distributions of components of a variety of RCS's, known as assemblies. 
By a combinatorial structure (CS) of a size one means a union of nondecomposable compo- 
nents (= components) of different sizes. Formally, such a structure is given by the two sets of integers 
{pn, A^ > 1} and {ttin, A^ > 1} that count respectively the total number of instances of size A^ 
and the number of components of size A^. An example of a CS is a graph on A^ vertices treated as a 
union of its connected components. Therefore, an instance of a CS of size A^ is given by a partition 
T] = (ni, . . . , riTv) G J^AT, where is the number of components of size k in the instance. By assuming 
that for a given A^ an instance is chosen randomly from all pi^ instances, one induces a RCS that is 
completely determined by the random component size counting process, the latter being a random 
vector with values in 17 tv- With an obvious abuse of notation we denote the random vector by rj. A 
remarkable fact in the theory of RCS's is that a great variety of them obey the conditioning relation 

N 

C{r])=£{Zi,...,ZN\^jZj = N), (5.1) 
i=i 

where Zi, . . . , Z]\f are independent integer valued r.v.'s. In particular, a class of CS's known as 
assemblies is characterized by the fact that Z^. are Poisson r.v.'s s.t. ~ Po{mk/k\), k = 
1,2, . . . , N. Consequently, by a straightforward calculation we find from (|5.1|) that for assemblies 

P(r/) = fiNiv), V e ^N, (5.2) 



15 



with Ofc = ^ and = A; = 1, 2, . . . , A^. In particular, the case ~ 9{k — 1)!, ^ > conforms 
to logarithmic RCS's that encompass permutations {rrik = {k — 1)!) and the Ewens sampling formula 
(mfc = 9{k — 1)1, 6 > 0). The novel theory of general logarithmic RCS's is presented in [J. Prom 
the analytical point of view, the common feature of logarithmic RCS's is that they do not obey the 
condition p > adopted in the present paper. As a result, our asymptotic analysis based on the 
Poisson summation formula is not applicable for logarithmic RCS's. 

The case of permutations has a long history. Por this case, the seminal result by V.L. Goncharov 
(1944) and L. Shepp and S. Lloyd (1966) states the following central limit theorem for i^jy: 

VlogiV 

A version of ()5.3|) for general logarithmic RCS's is also known. 

L. Mutafchiev (jl4j) proved a local limit theorem for i^tv under some assumptions on the asymptotic 
behavior as x — > 1 of the generating function of the sequence {c„}f° . It can be verified that in the 
case p > the assumptions (2.7) and (2.8) in [Tlj do not hold. However, Mutafchiev (|Tlj,p.425) 

conjectured, that a result similar to his Theorem 2.4 holds for a wider class of RCS's. Our Theorem 

p 

4.6 confirms this conjecture. In fact, in the case p > 0, we have Ei^^r ~ QpNp+^ and it is not hard 

p 

to see that, in agreement with the claim in |14j, QpNp+'^ ~ 5(6"*^^), N ^ oo, where aN is as in 

(iraii . 

A. Barbour and B. Granovsky (^) explored the case when in (|5.1|) are quite general r.v.'s obeying 
EZjfc ^ kP^^, k oo, p < 0. (This includes assemblies with m^. ~ kP~^k], k — > oo, p < 0). It was 
shown in ( 4 ) that such RCS's exhibit a completely different asymptotic behavior. In particular, in 
this case z^at is finite with probability 1. 

Hence, comparing the asymptotic behavior of i^n (e.g. Ei/jv) in the cases p < and p > 0, one sees 
that p = can be viewed as a point of phase transition for the measure fiN as N ^ oo. 

In conclusion, we provide a few examples of assemblies that conform to the setting of the present 
paper: 

Example 1. Forests of labelled and colored linear trees. A linear tree (see |5|) is a graph 
with no cycles, where each vertex has no more than two neighbors. Assuming that a vertex is labelled 
and is colored into one of q {q > 1) colors gives = q^kl, ak = q^, k > 1. So, by the remark 
following Theorem 4.6, this RCS corresponds to the case p = 1. 

Example 2. Forests of labelled rooted linear trees. In this case we have = kkl, 
k > 1, which gives ak = k, k > 1, that corresponds to the case p = 2. 

Example 3. Compositions, (see ^S]) Consider a space Ttv of ordered m-tuples 
X = (xi, . . . , Xm), m = 1, . . . ,N, where are positive integers, summing to N. In other words, Tjv 
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is a space of all ordered partitions (=compositions) of N. We define the probability measure Xjy on 
Tiv: 

AAr(x) = - — X = (xi, . . . G Tat, (5.4) 
(m(xjj! 

where (rjv)~^ is the normalizing constant and m = m(x) is the number of components of x. This 
means that Xn prescribes to x G Tn the weight (^jn{x)y. • I-^^note Ui = nj(x), i = 1, . . . , N the number 
of components of x G Yn that are equal to i. Then we have 



^1 ^1 ™l 1 

E— \^ 1 — — — "S^ (b 5) 

ml ^ ^ ml ^ ni!...nm! ^ ni!...nAr!' 

m=l xGTn: m(x)=m m=l ?jgr2jv:|»?|=m r^GQjv 

where, as before, Oat is the set of all (unordered) partitions of N, while \r]\ = ni + . . . + n]\f. H5.5() says 
that Tjv = Cat, where cat is the partition function of the measure fiN in the case = 1, k = 1, . . . , N . 
Therefore, we can apply our results for p = 1 to the number of summands in the random composition 
drawn according to A at- 
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